Japanese Controlled Language Rules to Improve Machine Translatability of Municipal Documents
نویسندگان
چکیده
We report on experiments to test the effectiveness of controlled language (CL) rules on texts from Japanese municipal websites. We compiled a set of rules by trial and error, systematically rewriting Japanese source texts and analysing the machine translation (MT) outputs. We then employed native English speakers with little knowledge of Japanese as human evaluators and tested the understandability and accuracy of the English machine translated text. Comparing the results of four MT systems showed that the effectiveness of CL rules varies depending on the particular MT systems. A preliminary selection of optimal rules for each system showed more than 15% increase in MT performance. We also assessed the readability of the Japanese source texts and discuss the way to make them compatible with the quality of the MT outputs.
منابع مشابه
Readability and Translatability Judgments for ‘Controlled Japanese’
We report on an experiment to test the efficacy of ‘controlled language’ authoring of technical documents in Japanese, with respect both to the readability of the Japanese source and the quality of the English machine-translated output. Using four MT systems, we tested two sets of writing rules designed for two document types written by authors with contrasting professional profiles. We elicite...
متن کاملAssessing a set of Controlled Language rules: Can they improve the performance of commercial Machine Translation systems?
This paper presents a preliminary evaluation of a set of Controlled Language (CL) rules selected within the framework of a research project on Machine Translation (MT). The main objective of this project is to improve the machine translatability of an English corpus. This corpus has been obtained from a global Internet security technology company, Symantec. The corpus contains XML alert notific...
متن کاملTwo in One – Can it work? Readability and Translatability by means of Controlled Language
According to common understanding the definition of a Controlled Language (CL) depends on its intended use: if improvement in text quality in terms of readability, clarity and comprehensibility is the primary goal of a CL, its rules will differ from CL rules which aim foremost at the improvement of translatability in an automated translation processing environment. In this article the following...
متن کاملControlled Korean for Korean-English MT
This paper addresses the issues in designing the so-called ‘Controlled Korean’ for a Korean-English MT system. Controlled Language is a sublanguage of a natural language which is supposed to improve the readability and the translatability of a text. Much effort has been made to design a controlled language for major international languages such as English, German, Spanish and etc. However, litt...
متن کاملCognitive Evaluation Approach for a Controlled Language Post--Editing Experiment
In emergency situations it is crucial that instructions are straightforward to understand. For this reason a controlled language for crisis management (CLCM), based on psycholinguistic studies of human comprehension under stress, was developed. In order to test the impact of CLCM machine translatability of this particular kind of sublanguage text, a previous experiment involving machine transl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015